Abusive Content Detection in Arabic Tweets Using Multi-Task Learning and Transformer-Based Models

نویسندگان

چکیده

Different social media platforms have become increasingly popular in the Arab world recent years. The increasing use of media, however, has also led to emergence a new challenge form abusive content, including hate speech, offensive language, and language. Existing research work focuses on automatic content detection as binary classification problem. In addition, existing task surrounding Arabic fails tackle dialect-specific phenomenon. Consequently, this two important issues task. study, we used multi-aspect annotation schema problem countries, based multi-class dialectal (DA)-specific More precisely, includes five attributes: directness, hostility, target, group, annotator. We specifically developed framework automatically detecting Twitter using natural language processing (NLP) techniques. different models machine learning (ML), deep (DL), pretrained (LMs) dataset. investigate impact other approaches, such multi-task (MTL), four MTL built top DA model (called MARBERT) trained Our LMs enhanced performance compared DL mentioned literature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Abusive Language Detection on Arabic Social Media

In this paper, we present our work on detecting abusive language on Arabic social media. We extract a list of obscene words and hashtags using common patterns used in offensive and rude communications. We also classify Twitter users according to whether they use any of these words or not in their tweets. We expand the list of obscene words using this classification, and we report results on a n...

متن کامل

Abusive Language Detection in Online User Content

Detection of abusive language in user generated online content has become an issue of increasing importance in recent years. Most current commercial methods make use of blacklists and regular expressions, however these measures fall short when contending with more subtle, less ham-fisted examples of hate speech. In this work, we develop a machine learning based method to detect hate speech on o...

متن کامل

Transferring Multi-device Localization Models using Latent Multi-task Learning

In this paper, we propose a latent multi-task learning algorithm to solve the multi-device indoor localization problem. Traditional indoor localization systems often assume that the collected signal data distributions are fixed, and thus the localization model learned on one device can be used on other devices without adaptation. However, by empirically studying the signal variation over differ...

متن کامل

Multi-objective Based Optimization Using Tap Setting Transformer, DG and Capacitor Placement in Distribution Networks

In this article, a multi-objective function for placement of Distributed Generation (DG) and capacitors with thetap setting of Under Load Tap Changer (ULTC) Transformer is introduced. Most of the recent articles have paidless attention to DG, capacitor placement and ULTC effects in the distribution network simultaneously. Insimulations, a comparison between different modes was carried out with,...

متن کامل

Using Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media

Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app13105825